Exploring the Relationship Between Salmon Populations and Protected Areas

EDS 222 Final Project
MEDS
Author
Affiliation
Published

November 25, 2022

Background

Populations of anadromous Pacific salmon within California are declining and many populations are expected to disappear entirely within the next 100 years (Katz et al., 2013). For this project I focus on two of these species, coho salmon and steelhead. Of the two evolutionary significant units of coho in California, one is federally listed as threatened and the other is endangered (Moyle et al., 2017). For steelhead there are seven distinct population segments within the state, five of which are federally listed as threatened and one of which is endangered (Moyle et al., 2017). Numerous factors have contributed to their decline and are ongoing threats, including drought and other climate factors, hatchery interactions, and over-fishing (Brown et al., 1994). In some areas, habitat loss is thought to be the predominant mechanism of historical decline, driven by dams, agriculture, and logging (Brown et al., 1994). Urban land use has been linked to the decline of salmon over time and has been shown to impact juvenile survival in the Pacific Northwest (Bilby & Mollot, 2008; Paulsen & Fisher, 2001). In California, many salmon populations spawn within areas that are now considered protected, including lands which have previously been ecologically damaged. For example, Redwood Creek watershed, which now partially resides within Redwood National Park, was previously extensively logged (Brown et al., 1994). I am interested in understanding how protected areas impact salmon populations in California, both in terms of population numbers and rate of change over time. I was unable to find research within California that documented the impact of established protected areas on salmon populations. Understanding this relationship can help managers improve land use practices to protect these species.

Data

To adress my questions, I used data from two different sources. For salmon population data, I pulled data from the California Monitoring Plan for salmon and steelhead (CMP), which is a joint effort between the California Department of Fish and Wildlife (CDFW) and NOAA to monitor and assemble data on anadromous salmon populations across California. For this project, I used estimates for spawning adult coho and steelhead, as this was available for many watershed level populations across a number of years. The data spans from 1981 to 2020 for steelhead and 1995 to 2020 for coho. This data was collected yearly at fixed sites for each population. In addition to the count data, I used geospatial data from the CMP which indicated the watershed or sub-watershed extent that was monitored for each population. To determine the amount of protected area within each watershed I used data from the California Protected Areas Database (CPAD) consisting of polygons of areas protected for open space use throughout California.

Data Wrangling

Geospatial

In order to calculate percent protected by watershed, I used spatial intersection to find the overlap between the protected areas in the CPAD database and each monitored watershed. I then divided the total area of overlap by the total watershed area and multiplied by 100 to find the % protected. I removed any protected areas established after 1981, as this was the first year of data available.

General

After finding percent protected for each watershed, I added this as a column in the population count data. I selected only for adult population counts, as these were the most consistent metric of salmon population data collected. For years when adult populations were estimated using more than one method for the same population, I took the average of the estimates. I also removed populations which never had any fish observed during the time period of my analysis. For ease of interpretation, I set the first year with observations as year 0 (1981 for steelhead and 1995 for coho).

Analysis

Initially, I ran a multiple linear regression. Population count was regressed on percent protected and year. I also included an interaction term between year and percent protected to allow the slope of the relationship between year and population estimate to vary based on percent protected.

\(populationcount_i = B_0 + B_1year_i + B_2percentprotected_i + B_3Year_i * percentProtected_i+E_i\)

Code
#stelhead regression
lm(populationcount~percent_protected + year + year:percent_protected, data = steelhead)
#coho regression
lm(populationcount~year + percent_protected + year:percent_protected, data = coho)

This initial analysis was unable normalize between populations or control for inconsistency in data collection. More populations were measured as time increased, and some sites were monitored inconsistently or only for a few years. To control for this I used cross sectional fixed affects. Unfortunately, since percent protected did not vary based on year, it was not able to be separated from population by adding fixed affects to my initial regression equation. Instead, I used fixed effects to look at the interaction effect between year and percent protected. I also used clustered standard error to account for the fact that the same populations were likely to have correlated errors over time.

\(populationcount_i = B_0 + B_1year_i + B_2year_i * percentProtected_i+E_i\)

Code
#steelhead
felm(populationcount~year + year:percent_protected | Population|0|Population, data = steelhead)

#coho
felm(populationcount~percent_protected | year|0|year, data = coho)

Results

Steelhead

Multiregression with interaction

Increase in percent protected has a positive effect on the number of steelhead. When year is 0 (1981) there will be on average 15 more steelhead for each one percentage increase in percent protected. Increase in percent protected decreases the slope of the relationship between year and population count. -0.45 is the difference in the effect of year on steelhead population count for every one increase in protected area. When percent protected is zero on average there will be an increase of 35 fish per year.

Using a significance level of 0.05, \(B_1\) , \(B_2\), and \(B_3\) are statistically significant (table 1). There is evidence to reject the null.

term estimate std.error statistic p.value conf.low conf.high
(Intercept) -724.84 412.23 -1.76 0.079 -1534.01 84.33
percent_protected 14.52 6.37 2.28 0.023 2.01 27.03
year 35.26 12.75 2.77 0.006 10.24 60.28
percent_protected:year -0.45 0.20 -2.27 0.024 -0.85 -0.06

Interaction with population cross-sectional fixed effects

Controlling for differences between populations and clustering standard errors provides similar results. -0.43 is the difference in the effect of year on steelhead population count for every one increase in protected area. \(B_1\) and \(B_2\) are statistically significant, providing evidence to reject the null.

term estimate std.error statistic p.value conf.low conf.high
year 24.75 9.91 2.5 0.015 4.98 44.52
year:percent_protected -0.43 0.16 -2.6 0.011 -0.76 -0.10

Coho

Multi-regression with interaction

The coho results show similar patterns to the steelhead results. However, using a significance level of 0.05, \(B_1\) , \(B_2\), and \(B_3\) are not statistically significant (table 2) . My 95% confidence intervals include 0 and I fail to reject the null.

term estimate std.error statistic p.value conf.low conf.high
(Intercept) 203.32 225.88 0.90 0.368 -240.33 646.97
year 17.56 12.21 1.44 0.151 -6.42 41.53
percent_protected 2.04 3.27 0.62 0.533 -4.38 8.46
year:percent_protected -0.29 0.18 -1.55 0.122 -0.65 0.08

Interaction with population cross-sectional fixed effects

Controlling for differences between populations and clustering standard errors provides similar results. \(B_1\) and \(B_2\) are not statistically significant. My 95% confidence intervals include 0 and I fail to reject the null.

term estimate std.error statistic p.value conf.low conf.high
year 24.55 18.16 1.35 0.183 -11.94 61.03
year:percent_protected -0.37 0.23 -1.59 0.117 -0.83 0.10

Discussion and Limitations

My results indicate that there is a statistically significant decrease in the rate of steelhead populations over time as percent protected increases. It is unclear to me why this might be the case, however it is somewhat concerning. My assumption is that there is omitted variable bias causing this relationship. In order to investigate this it would be useful to include potential drivers of salmon populations (rainfall, stream flow, available habitat etc.). Other potential omitted variables include previously identified trends such as declines from North to South and declines as distance to reach spawning sites increases (Kope & Wainwright, 1998). Incorporating these factors could help identify any correlations between them and both percent protected and population. My results also indicate that increase in percent protected is related to an increase in number of steelhead, however this may be inaccurate as I was unable to control for other differences between populations that may have been driving this relationship.

It is worth noting that the population change over time may not be linear in parameters. Salmon populations in the data fluctuate from year to year. There are also hatchery fish released in California to supplement populations, for both recreational and conservation purposes (Brown et al., 1994). These releases may muddle overall population trends (Kope & Wainwright, 1998).

To further address my question, a better method would be to look at the relationship between protected areas and the rate of overwinter survival for juvenile salmonids. This would help isolate factors specifically related to protected areas, and alleviate concern in non-linear population trends over time. It would also focus on the life-stage that is spent entirely in freshwater habitat, and thereby most likely affected by inland protected areas. Unfortunately, overwinter survival data does not appear to be collected consistently across watersheds in California, which would make further data collection necessary for this analysis.

In terms of protected area data, my data had an extensive (roughly 30%) number of NAs in the year established column, making it unclear whether all of the protected areas I used existed during the whole study period. CPAD also uses a loose definition of protected and includes any lands that were maintained for open space purposes. It could be better to address this question using only protected areas which are specifically managed for biodiversity purposes. This type of data is available through USGS, however I opted not to use this data due to even more extensive NAs in the year established column.

More research is necessary in order to truly understand whether or not protected areas influence California salmon populations. Due to limitations in my study, I do not feel confident making any assumptions at this point.

Citations